Coarse grained gather and scatter operations with applications
نویسندگان
چکیده
We introduce asymptotically optimal algorithms for gathering and scattering a small-to-moderate sized set of data on a coarse grained parallel computer. We use these operations to obtain efficient to optimal solutions to several fundamental problems in image processing and string matching (exact or approximate) for coarse grained parallel computers. © 2004 Elsevier Inc. All rights reserved.
منابع مشابه
Array-Based Reduction Operations for a Parallel Adaptive FEM
For many applications of scientific computing, reduction operations may cause a performance bottleneck. In this article, the performance of different coarseand fine-grained methods for implementing the reduction is investigated. Fine-grained reductions using atomic operations or fine-grained explicit locks are compared to the coarse-grained reduction operations supplied by OpenMP and MPI. The r...
متن کاملVector Prefix and Reduction Computation on Coarse-Grained, Distributed-Memory Parallel Machines
Vector prefix and reduction are collective communication primitives in which all processors must cooperate. We present two parallel algorithms, the direct algorithm and the split algorithm, for vector prefix and reduction computation on coarse-grained, distributed-memory parallel machines. Our algorithms are relatively architecture independent and can be used effectively in many applications su...
متن کاملPerformance Improvements of Microprocessor Platforms with a Coarse-Grained Reconfigurable Data-Path
This paper presents the performance improvements by coupling a high-performance coarse-grained reconfigurable data-path with a microprocessor in a generic platform. It is composed by computational units able to realize complex operations which aid in improving the performance of time critical application parts, called kernels. A design flow is proposed for mapping software descriptions to the m...
متن کاملSpace-efficient Mapping of 2D-DCT onto Dynamically Configurable Coarse-Grained Architectures
This paper shows an eecient design for 2D-DCT on dynamically conngurable coarse-grained architectures. Such coarse-grained ar-chitectures can provide improved performance for computationally demanding applications as compared to ne-grained FPGAs. We have developed a novel technique for deriving computation structures for two dimensional homogeneous computations. In this technique, the speed of ...
متن کاملA Tri-modal 2024 Al -B4C composites with super-high strength and ductility: Effect of coarse-grained aluminum fraction on mechanical behavior
In this study, ultrafine grained 2024 Al alloy based B4C particles reinforced composite was produced by mechanical milling and hot extrusion. Mechanical milling was used to synthesize the nanostructured Al2024 in attrition mill under argon atmosphere up to 50h. A similar process was used to produce Al2024-5%wt. B4C composite powder. To produce trimodal composites, milled powders were combined w...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- J. Parallel Distrib. Comput.
دوره 64 شماره
صفحات -
تاریخ انتشار 2004